Phramer - An Open Source Statistical Phrase-Based Translator

نویسندگان

  • Marian Olteanu
  • Chris Davis
  • Ionut Volosen
  • Dan I. Moldovan
چکیده

This paper describes the open-source Phrase-Based Statistical Machine Translation Decoder Phramer. The paper also presents the UTD (HLTRI) system build for the WMT06 shared task. Our goal was to improve the translation quality by enhancing the translation table and by preprocessing the source language text

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Open-Source Hierarchical Phrase-Based Translation System

We present an open source translation system that provides a clean-room implementation of the hierarchical phrase-based statistical translation model introduced in (Chiang, 2005) and refined in (Chiang, 2007). To our knowledge this is the first freely available hierarchical phrase-based translation system which implements cube pruning. We introduce extensions to (Chiang, 2007) to take advantage...

متن کامل

NiuTrans: An Open Source Toolkit for Phrase-based and Syntax-based Machine Translation

We present a new open source toolkit for phrase-based and syntax-based machine translation. The toolkit supports several state-of-the-art models developed in statistical machine translation, including the phrase-based model, the hierachical phrase-based model, and various syntaxbased models. The key innovation provided by the toolkit is that the decoder can work with various grammars and offers...

متن کامل

Interactive Machine Translation Based on Partial Statistical Phrase-based Alignments

State-of-the-art Machine Translation (MT) systems are still far from being perfect. An alternative is the so-called Interactive Machine Translation (IMT) framework. In this framework, the knowledge of a human translator is combined with a MT system. We present a new technique for IMT which is based on the generation of partial alignments at phrase-level. The proposed technique partially aligns ...

متن کامل

Spanish-Catalan Translator Using Statistical Methods

The development of a Spanish-Catalan statistical machine translation system is described in this paper. The methodology used attempts to solve the problem using a purely inductive approach, without using linguistic knowledge. To obtain the translator, we perform the following steps: First, we obtain a bilingual corpus from Internet. Second, we fragment the corpus into units (sentences and token...

متن کامل

Jane 2: Open Source Phrase-based and Hierarchical Statistical Machine Translation

We present Jane 2, an open source toolkit supporting both the phrase-based and the hierarchical phrase-based paradigm for statistical machine translation. It is implemented in C++ and provides efficient decoding algorithms and data structures. This work focuses on the description of its phrase-based functionality. In addition to the standard pipeline, including phrase extraction and parameter o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006